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Numerical optimization is used to design linear-optical devices that implement a desired quantum 
gate with perfect fidelity, while maximizing the success rate. For the 2-qubit CS (or CNOT) gate, 
we provide numerical evidence that the maximum success rate is S = 2/27 using two unentangled 
ancilla resources; interestingly, additional ancilla resources do not increase the success rate. For the 
3-qubit Toffoli gate, we show that perfect fidelity is obtained with only three unentangled ancilla 
photons - less than in any existing scheme - with a maximum S — 0.00340. This compares well 
with S = (2/27) 2 /2 « 0.00274, obtainable by combining two CNOT gates and a passive quantum 
filter pQ. The general optimization approach can easily be applied to other areas of interest, such 
as quantum error correction, cryptography, and metrology [2j|3]. 
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Linear optics is considered as a viable method for 
scalable quantum information processing, due in large 
part to the seminal work of Knill, Laflamme, and Mil- 
burn (KLM) [4]. These authors showed that an elemen- 
tary quantum logic gate on qubits, encoded in photonic 
states, can be constructed using a combination of linear- 
optical elements and quantum measurement. The trade- 
off in this measurement-assisted scheme is that the gate is 
properly implemented only when the measurement yields 
a positive outcome, i.e., the gate is non-deterministic. 
Soon after the KLM scheme became a paradigm for linear 
optical quantum computation (LOQC), it became clear 
that there is a general unresolved theoretical problem of 
finding the optimal implementation for a desired quan- 
tum transformation [5]. 

For the nonlinear sign (NS) gate, which acts on pho- 
tons in a single optical mode, ao|0) + ai|l) + c*2|2) — > 
ao|0) + o;i |1) — c^2 12) , the maximum success probability 
without feed-forward has been theoretically proved to be 
1/4 [6]. Here we focus on more complicated gates, tak- 
ing as examples the two-qubit controlled sign (CS) gate 
( equivalent ly, the CNOT gate), and the three-qubit Tof- 
foli gate. For these physically important gates, existing 
theoretical results are limited to upper or lower bounds 
on the success probability [TJ [71 [8] . 

A linear-optical quantum gate, or state generator (LO- 
QSG) [5 , can be viewed formally as a device imple- 
menting a contraction transformation (for ideal detec- 
tors) that converts pure input states into desired pure 
output states. The goal of the optimization problem is to 
find a proper linear optical network (see Fig. 1), charac- 
terized by a unitary matrix U, that performs the desired 
transformation [9l [10] . The problem is naturally parti- 
tioned into two tasks: i) finding a subspace of perfect 
fidelity within the space of all unitary matrices U, and 
ii) maximizing the success probability within this sub- 
space. While in this paper we address transformations 



implemented by linear optics, the method is universal 
and with minor modifications can be successfully applied 
to any quantum-information problem involving unitary 
operations combined with measurements. 
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FIG. 1: A general measurement- assisted transformation (e.g., 
a quantum logic gate or a linear-optical quantum state gen- 
erator) . 

Originally, the linear-optical device was envisioned as 
a network of linear-optical elements [11] , as for example 
in the original KLM scheme, where the CS gate is con- 
structed as a combination of two NS gates [4] . In practice, 
a functional microchip-based device may instead consid- 
ered as an integrated light circuit [TT] . performing one 
large operation, as shown schematically in Fig. [I] Here 
the input state |tf in ) = |^^ mp )(g)|^f n ncina )(g)|^ vacuum ) is a 
product of a computational input state, an ancilla state, 
and possibly a vacuum state. Assuming dual-rail encod- 
ing, the computational state will consist of M c photons in 
N c = 2M C optical modes encoding an arbitrary state of 
M c qubits, e.g., the logical two-qubit state | |) | T) mav 
be represented in four optical modes by |li, O2) ® 1 13 , O4) 
in the Fock basis. The ancilla input state of M a photons 
distributed over N a modes may be a separable state, an 
entangled state, or even half of an ebit state carrying spa- 
tially distributed entanglement, as required for example 
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in entanglement-assisted error correction [2j [12] . Finally, 
N v auxiliary vacuum modes contain zero photons in the 
initial state. 



The core of the device is the transformation a, 
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of the photon creation operators between the 
input and output states. Here U, which contains all phys- 
ical properties of the device, is an N x TV unitary matrix, 
where N — N c + N a + N v is the total number of modes. 
The matrix U associated with the physical device induces 
a transformation ft acting on the input state, where ft is 
a high-dimensional irreducible representation of U [13]. 
Writing the total input state in the Fock representation 
as |^in) = |ni,n 2 , . . . ,n N ), where = M c + M a is 

the total number of photons, ft takes the form 
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Next, a measurement is applied to the N — N c an- 
cilla and vacuum modes. This measurement is formally 
described by a Kraus POVM operator acting on these 
modes only: P = |0jv c +i, 0iv c +2, • • • , 0n)(& measured • 
In the most natural case of a photocounting measure- 
ment, (^measuredl = (&N C + 1 , ^N c +2 , • • • , k N | , where ki 

is the number of photons measured in the i-th mode. 
Finally, the resulting transformation of the computa- 
tional state is a contraction quantum map l^u™ 13 ) = 

A|*?r p >/iiAi*r 



A(U)|* 



comp\ 



° mp )|| [14 , where A(U) is defined by 

(k Ne +iM c +2,---M\n(v)\*m)- (2) 



The linear operator A contains all the information of 
relevance to the gate or state transformation. 

Now we consider the main properties of Eq. ([2| rele- 
vant to the optimization problem. In the Fock basis, the 
matrix A is a submatrix of the larger matrix ft, and in ac- 
cordance with Eq. ([!]) , matrix elements of A are given as 
polynomials of degree M c + M a in the matrix elements of 
U. For example, the matrix element for transforming the 
two-qubit computational state | |) <g) | |) into | T) ® I |), 
with two ancilla photons in modes 5 and 6, is 



(I1O2O3I4IAII1O2I3O4) 



31,32,33 ,34 

permutations (1,4,5,6) 



U hh U 3j2 U 5,33 U 6,, 



(3) 

More generally, in the Fock representation, all matrix 
elements are calculated as permanents of U [5] . 

Furthermore, if the total number of measured photons 
X^Ar c +i hi equals the number of input ancilla photons 

Y^i=N c +i n ^'> t nen Eq. ([i]) leaves the number of compu- 
tational photons invariant. The dual-rail computational 
basis is a subset of all possible states of M c photons in 
the 2M C computational modes. Thus, the transformation 



matrix A is a rectangular matrix, mapping the computa- 
tional Hilbert space, of dimension 2 Mc , to a larger Hilbert 
space, of dimension (3M C - 1)!/(2M C - 1)!(M C )!. 

We now define precisely the operational fidelity of a 
transformation, which in general differs from the com- 
mon measure of fidelity for a state transformation [T5] . 
Physically, the transformation A has 100% fidelity if it is 
proportional to the target transformation operation A t , 
i.e., A = gA t , where g is an arbitrary complex number 
(in which case S — \g\ 2 is the success probability of the 
transformation [16]). In general, we may consider com- 
plex rays /3\A and #2 At, /^i,^2 C C as elements of a 
complex projective space, and define the fidelity as 



F(U) 



(A|A t )(A t |A) 
(A I A) (A t I A t ) 



(4) 



where A = A(U) is defined by Eq. The Hermitian 
inner product is (A|B) = Tr(A^B)/D c , and D c = 2 Mc is 
the dimension of the computational space. F is closely 
related to the Fubini-Study distance 7 = cos -1 (\/F) [T7] . 
but for numerical computations F has the advantage of 
being non-singular near F = 1. 

In general, the success probability S depends on 
the initial state |\^° mp ). S is bounded above by the 
square of the operator norm, ||A|| 2 = {A}^ ax = 
Max((vl/£ ip |AtA|vl/£ ip )) ) and below by {A}l in = 
Min((^|^ mp | A^A|^|^ mp )), where the maximum and 
minimum are taken over the set of properly normalized 
input states. As a more convenient measure, we use the 
norm (A|A). It is easy to verify that {A}^ in < (A|A) < 
{A}^ ax . As fidelity F -> 1, {A} min /{A} max -> 1 and 
S becomes independent of the initial state. We refer to 
S = (A|A) as the success probability, keeping in mind 
that it may not correspond to the success probability for 
every initial state, except in the case of perfect fidelity. 

Once the success rate S(U) = (A(U)| A(U)) and fi- 
delity F(JJ) have been constructed for a given target 
transformation and given ancilla resources, the task is 
to find the unitary matrix U that maximizes *S(U) on 
the constraint set F(U) = 1. We parametrize U = 

ex P (52jLi where Hj is a complete set of com- 

plex anti-Hermitian N x N matrices, and find a local 
maximum of F in x space. When the optimization has 
converged, we check whether F(XJ) = 1 to numerical ac- 
curacy, in which case we proceed to find a local maximum 
of success probability S(XJ) on the F(XJ) = 1 surface. Re- 
peating the process with multiple randomly chosen start- 
ing points, we obtain the best S, which yields the optimal 
design for the quantum circuit. 

We first apply our approach to the CS gate. For an 
arbitrary two-qubit gate, A(U) and A t are 10 x 4 ma- 
trices; for the CS gate matrix A t is determined accord- 
ing to the action of the gate 1 0101) — > — a|0101), and 
l^in mP ) c ^in° mP > f o r (0101|^ n omp ) = 0. The vari- 
able a is an arbitrary non-zero complex constant, in 
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agreement with the standard definition of a projective 
complex space. Here, one easily checks that the mini- 
mum number of unentangled ancillas needed to obtain 
perfect fidelity is N a — 2, so that U is a 6 x 6 matrix 
(N = N c + N a = 4 + 2 = 6). In this case, we find that 
the second optimization stage is unnecessary, i.e., S(U) 
is a constant on every F = 1 manifold (each such mani- 
fold consisting of an equivalence class of matrices differ- 
ing only by phase factors). Several inequivalent F = 1 
manifolds are found. The best solutions have S = 2/27, 
corresponding to an analytic solution found previously 
by Knill [8]. Due to the complexity of the CS gate, it 
is not known if an analytical proof for determining the 
maximum success probability is possible. Our numerical 
evidence, however, strongly indicates that Knill's solu- 
tion is indeed the global maximum. 

Can the solution be improved by adding N v vacuum 
modes to the device? This question may be answered 
straightforwardly by repeating the above optimization 
with (N c + N a + N v ) x (N c + N a + N v ) unitary matrices 
U, for various values of N v . However, there exists an 
alternative "unitary dilation" approach [5]. 

The most general device design, allowing for an arbi- 
trary number of vacuum modes, is obtained by allowing 
U to be an arbitrary complex (N c + N a ) x (N c + N a ) 
matrix, and replacing U — > U/||U||, which scales the 
maximum singular value to unity. The expression Q 
for fidelity as a function of U is invariant under scaling, 
while the generalized success function for a nonunitary 
U is given by 



0.08 



CS gate implemented with 2, 3 and 4 ancillas 



5(U) = 5(U)/(||U| 



n2(M c +M„) 



(5) 



S(XJ) has a discontinuous gradient whenever the largest 
singular value of U goes through a double or higher-order 
degeneracy. Of particular interest is the fact that S, while 
well-behaved on the manifold of unitary U, has a singular 
cusp-like structure in the neighborhood of this manifold. 

The result of a non-unitary optimization for the CS 
gate with two ancillas is shown in Fig. [2] (circles). 
Plateaus are clearly visible, corresponding to local max- 
ima of the success rate. Indeed, we have continued the 
optimization for a much larger number of iterations with 
higher numerical accuracy for data points characterized 
by S > 0.04. In doing so, we found that the suc- 
cess rate always converges either to the Knill solution 
(S = 2/27 ~ 0.074) or to one of two local maxima: 
S « 0.047 and S = 1/16 = 0.0625. The Knill solution has 
the largest basin of attraction, while the basin of attrac- 
tion for S = 1/16 is the smallest. We note that the KLM 
scheme [4] produces a success rate S = 1/16, though 
the matrices obtained numerically have a structure dif- 
ferent from the KLM form. The pronounced plateau at 
2/27 provides strong numerical evidence that Knill's so- 
lution (which makes no use of vacuum modes) is globally 
optimal, even when vacuum modes are allowed. It ap- 
pears that the cusp-like structure of the success rate ([5| 
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FIG. 2: The optimized success probability for the CS gate 
is shown for two, three, and four ancillas, and an arbitrary 
number of vacuum modes. Each point indicates a complete 
run starting from a randomly chosen starting matrix U. The 
success rates are arranged in ascending order, so that the 
horizontal axis may be viewed as a cumulative frequency. The 
2/27 ~ 0.074 success rate found by Knill [8] is indicated by a 
horizontal line. 



strongly favors maxima appearing at unitary values of 
U (where all singular values become degenerate at 1), 
and indeed the global maximum corresponds to one such 
unitary matrix: the Knill matrix. 

Interestingly, analytical fidelity-preserving transfor- 
mations, constructed by extending the Grobner basis 
method [5], can explain seven dimensions of the F = 1 
subspace, while direct numerical tests reveal that this 
subspace is 11-dimensional in the vicinity of the Knill so- 
lution. Thus, there exist hidden symmetries, which, we 
believe, can be identified only by more powerful mathe- 
matical methods from the repertoire of algebraic geome- 
try. 

Next, we investigate the effect that additional ancilla 
resources may have on the optimization problem. Pre- 
viously, an upper bound for the success probability with 
unentangled resources was shown to be 3/4 [7 j. Repeat- 
ing our optimization procedure in larger matrix spaces 
associated with three and four ancillas (Fig. |2|, we find, 
surprisingly, that the global and local maxima are un- 
changed. This suggests that the minimum resources 
needed to produce the CS gate with perfect fidelity (i.e., 
two unentangled ancillas with no vacuum modes) also 
suffice to produce the best possible success rate. In view 
of the fact that exactly the same behavior of success prob- 
ability has been found for the NS gate [6 , one may ex- 
pect that this may be a universal property of probabilistic 
(photonic) gates: the maximal success probability is at- 
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tained with minimal required resources. We tested this 
conclusion also for the Toffoli gate, which we discuss be- 
low in detail, finding that adding one more ancilla to the 
required minimum of three ancilla photons also does not 
affect any of two local maxima, in full compliance with 
the suggested rule. 
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FIG. 3: The distribution of success rates for the Toffoli gate. 

Now we consider the three-qubit Toffoli gate. After a 
local Hadamard rotation, the standard Toffoli gate acts 
as a "sign" transformation: |010101) — > a|010101), and 
l^in° mP ) -^in° mP ) fOT (010101|^ n omp ) = 0. We first 
check that a minimum of three ancillas are needed for 
perfect fidelity. Thus N = N c + N a = 6 + 3 = 9. 

To reduce the size of the parameter space, and improve 
the convergence of the success optimization, we consider 
the following ansatz for U: Uij = Uji = 5ij for i = 2, 
4, 6, i.e., U is designed to act non-trivially only on the 
computational modes 1, 3, and 5. 

The results of an optimization over 9x9 subunitary 
matrices are shown in Fig. [3J The best solution obtained 
is S ~ 0.00340; This is an improvement over combining a 
CNOT gate, a CS gate, and a "passive quantum filter" to 
produce the Toffoli gate pQ, which yields a total success 
rate S = (2/27) 2 x 1/2 « 0^00274 using four unentangled 
ancilla photons. 

Of practical interest is our finding that optimization 
in the full 10 x 10 matrix space is much more efficient 
than in 13 x 13 unitary space, even though the optimal 
N v = 3 solution is an element of both. The space of 
unitary U contains many local maxima of the success 
rate, preventing the global maximum from being reached. 
Eliminating the unitarity constraint creates passageways 
connecting the local maxima to the global one. 

In this work, we have provided numerical evidence 
that the previously obtained solution for the CS (CNOT) 
gate, with a success probability S = 2/27, is optimal, 
and cannot be improved by adding ancillas or auxil- 



iary vacuum modes. On the other hand, for the Tof- 
foli gate we show a new solution, which surpasses what 
has been obtained analytically using unentangled ancillas 
(our solution provides a higher success probability using 
fewer resources). This result is a proof of principle of 
successful numerical optimization in linear optical quan- 
tum information processing. Future directions that nat- 
urally suggest themselves include: optimal implementa- 
tion of two- mode biphotonic qutrit [18] gates, operations 
on multi-rail encoded qudits using angular momentum 
photons [19] , design of gates that are robust to noise and 
photon loss, and optimization in the context of error- 
correcting codes [2]. 
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